KURD - A Formalism for Shallow Post Morphological Processing

نویسندگان

  • Michael Carl
  • Antje Schmidt-Wigger
  • Munpyo Hong
چکیده

In most NLP applications an input text undergoes a number of transformations until the desired information can be extracted from it. Typically, such transformations involve part of speech tagging, morphological analysis such as lemmatization or full derivational and compositional analysis, context dependent disambiguation of tagging results, multi-word recognition, shallow, partial or full syntactic parsing, semantic analysis and so on. It is not always evident what level of analysis should be involved. For instance, whether a certain task requires a full parse or whether some `shallow' operations may be su cient is often difcult to determine. The choice of the involved tools can be guided by the data or the requirements or premises of the goal to be reached. These considerations may depend on the availability of a grammatical model, the required standard of the results and processing time constraints. However, the optimization of this task remains an unresolved area until now. The interest of the NLP community for 'shallow' processing has grown recently (cf. (Skut et al., 1997),(Abney, 1996)). In this paper, we describe a simple formalism (KURD) that is designed to perform some `shallow' operations on morphologically analyzed texts. The output can be used directly, or be redirected to further (e.g. linguistic or statistic) processing.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Shallow Post Morphological Processing with KURD

In this paper we describe a constraint based formalism that manipulates sequences of morphological analyses in order to Kill, Unify, Replace or Delete parts of the structure. We compare the formalism to a similar approach (CGP) and describe two applications.

متن کامل

Spejd: A Shallow Processing and Morphological Disambiguation Tool

This article presents a formalism and a beta version of a new tool for simultaneous morphosyntactic disambiguation and shallow parsing. Unlike in the case of other shallow parsing formalisms, the rules of the grammar allow for explicit morphosyntactic disambiguation statements, independently of structure-building statements, which facilitates the task of the shallow parsing of morphosyntactical...

متن کامل

Looking for Errors: A Declarative Formalism for Resource-adaptive Language Checking

The paper describes a phenomenon-based approach to grammar checking, which draws on the integration of different shallow NLP technologies, including morphological and POS taggers, as well as probabilistic and rule-based partial parsers. We present a declarative specification formalism for grammar checking and controlled language applications which greatly facilitates the development of checking...

متن کامل

Looking for Errors

The paper describes a phenomenon-based approach to grammar checking, which draws on the integration of different shallow NLP technologies, including morphological and POS taggers, as well as probabilistic and rule-based partial parsers. We present a declarative specification formalism for grammar checking and controlled language applications which greatly facilitates the development of checking...

متن کامل

CATCG: Un sistema de análisis morfosintáctico para el catalán

CATCG is a shallow parser for Catalan. It uses the Constraint Grammar formalism and contains three basic tools: a morphological analyser, a POS tagger and a shallow parser.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002